Relation Extraction with Massive Seed and Large Corpora
نویسندگان
چکیده
The research area of information extraction (IE) aims to extract relevant structured information from natural language texts. In addition to the named-entity recognition (NER) task, the identification and classification of relations among entities, namely, the so-called relation extraction (RE) task, is particularly important for many real-world applications. Given the sentence in Figure 1, a RE system should be able to recognize the underlined mentions of entities and their semantic relation, i. e., a marriage.
منابع مشابه
Applying the Espresso-algorithm to large parsed corpora
Information extraction systems learn patterns for extracting pairs instantiating a given relation from text. For instance, for the relation capital of a system might learn extraction patterns such as ‘Arg1 is capital of Arg2’, or ’The embassador of arg2 was called back to Arg1’. Lightly supervised information extraction systems learn extraction patterns by means of a bootstrapping procedure, wh...
متن کاملURES : an Unsupervised Web Relation Extraction System
Most information extraction systems either use hand written extraction patterns or use a machine learning algorithm that is trained on a manually annotated corpus. Both of these approaches require massive human effort and hence prevent information extraction from becoming more widely applicable. In this paper we present URES (Unsupervised Relation Extraction System), which extracts relations fr...
متن کاملAn Efficient Active Learning Framework for New Relation Types
Relation extraction is a fundamental task in information extraction. Different methods have been studied for building a relation extraction system. Supervised training of models for this task has yielded good performance, but at substantial cost for the annotation of large training corpora (About 40K same-sentence entity pairs). Semi-supervised methods can only require a seed set, but the perfo...
متن کاملLarge-Scale Learning of Relation-Extraction Rules with Distant Supervision from the Web
We present a large-scale relation extraction (RE) system which learns grammar-based RE rules from the Web by utilizing large numbers of relation instances as seed. Our goal is to obtain rule sets large enough to cover the actual range of linguistic variation, thus tackling the long-tail problem of real-world applications. A variant of distant supervision learns several relations in parallel, en...
متن کاملExploratory Relation Extraction in Large Text Corpora
In this paper, we propose and demonstrate Exploratory Relation Extraction (ERE), a novel approach to identifying and extracting relations from large text corpora based on user-driven and data-guided incremental exploration. We draw upon ideas from the information seeking paradigm of Exploratory Search (ES) to enable an exploration process in which users begin with a vaguely defined information ...
متن کامل